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Monstrous Moonshine: The first twenty-five years 


TERRY GANNON 
Abstract 


Twenty-five years ago, Conway and Norton published in this journal! their remarkable paper ‘Monstrous Moon- 
shine’, proposing a completely unexpected relationship between finite simple groups and modular functions. This 


paper reviews the progress made in broadening and understanding that relationship. 


1. Introduction 


It has been approximately twenty-five years since John McKay remarked that 
196 884 = 196 883 +1. (1.1) 


That time has seen the discovery of important structures, the establishment of another 
deep connection between number theory and algebra, and a reinforcement of a new era 
of cooperation between pure mathematics and mathematical physics. It is a beautiful 
and accessible example of how mathematics can be driven by strictly conceptual concerns, 
and of how the particular and the general can feed off each other. Now, six years after 
Borcherds’ Fields Medal, the original flurry of activity is over; the new period should be 
one of consolidation and generalisation and should witness the gradual movement of this 
still rather esoteric corner of mathematics toward the mainstream. 

The central question McKay’s equation (1.1) raises, is: What does the j-function (the 
left side) have to do with the Monster finite group (the right side)? Many would argue that 
we still don’t have our finger on the essence of the matter. But what is clear is that we 
understand far more about this central question today than we did in 1978. Today we say 
that there is a vertex operator algebra, called the Moonshine module V4, which interpolates 
between the left and right sides of (1.1): its automorphism group is the Monster and its 
graded dimension is the j-function (—744). 

This paper tries to summarise this work of the past twenty-five years in about as many 
pages. The original article [24] is still very readable and contains a wealth of information 
not found in other sources. Other reviews are [21], [87], [12], [39], [90], [48], [15], [78], 
[102], [16], [46] and the introductory chapter in [44], and each has its own emphasis. Our 
own bias here has been to breadth at the expense of depth, which probably limits this 
review to be a mere annotated sampling of representative literature. 


2. Background 


In §2.1 we describe the finite simple groups and in particular the Monster. In §2.2 we 
focus on the modular groups and functions which arise in Monstrous Moonshine. 


2.1. The Monster. By definition, a simple group is one whose only normal subgroups 
are the trivial ones: {1} and the group itself. The importance of the finite simple groups 
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lies in their role as building blocks, in the sense that any finite group G can be constructed 
from {1} by extending successively by a (unique up to order) sequence of finite simple 
groups. For example the symmetric group S4 arises in this way from the cyclic groups 
C2, C2, C3, Co. 

A formidable accomplishment last century was determining the explicit list of all finite 
simple groups. See e.g. [49] for more details and references. These groups are: 

(i) the cyclic groups Cp, p prime; 

(ii) the alternating groups An, n > 5; 
(iii) 16 infinite families of groups of Lie type; 
(iv) 26 sporadic groups. 


An example of a family of finite simple group of Lie type is PSL,,(F,), i.e. the group 
of n x n matrices with determinant 1, and entries from the finite field F}, quotiented out 
by its centre (the scalar matrices al, where a” = 1). 

The mysteriousness of the sporadics is due to their falling outside those infinite fam- 
ilies. They range in size from the Mathieu group M11, with order 7920 and discovered in 
1861, to the Monster M, with order 


[MI] = 245 . 320 . 59 . 76 . 112. 133-17 - 19 - 23 - 29 - 31 - 41 -47 -59-71 %8 x 1055. (2.1) 


The existence of M was conjectured in 1973 by Fischer and Griess, and finally constructed 
in 1980 by Griess [50]. Most sporadics arise in M (e.g. as quotients of subgroups). We’ll 
encounter many of these sporadics in the coming pages, but most of our attention will be 
directed at M. 

Griess showed in fact that M was the automorphism group of a 196883-dimensional 
commutative nonassociative algebra, now called the Griess algebra, but the construction 
was somewhat artificial. We now understand [44] the Griess algebra as the first nontrivial 
tier of an infinite-dimensional graded algebra, the Moonshine module V”, which lies at 
the heart of Monstrous Moonshine. We’ll discuss V” in §4.2; we will find that it has a 
very rich algebraic structure, is conjectured to obey a strong uniqueness property, and has 
automorphism group M. 

The Monster M has a remarkably simple presentation. As with any noncyclic finite 
simple group, it is generated by its involutions (i.e. elements of order 2) and so will be a 
homomorphic image of a Coxeter group. Let Gpgr, p > q È r È 2, be the graph consisting of 
three strands of lengths p+1,q+1,r+1, sharing a common endpoint. Label the p+q+r+1 
nodes as in Figure 1. Given any graph Gpgr, define Ypqr to be the group consisting of a 
generator for each node, obeying the usual Coxeter group relations (i.e. all generators are 
involutions, and the product gg’ of two generators has order 3 or 2, depending on whether 
or not the two nodes are adjacent), together with one more relation: 


(ab, bgac,c2ad;d2)'° = 1). (2.2) 


The groups Ypqr, for p < 5, have now all been identified (see e.g. [61]). Conway conjec- 
tured and, building on work by Ivanov [60], Norton proved [98] that Ys55 = Y444 is the 
‘Bimonster’, the wreathed-square M ? Co of the Monster (so has order 2|M|?). A closely 
related presentation of the Bimonster has 26 involutions as generators and has relations 
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given by the incidence graph of the projective plane of order 3; the Monster itself arises 
from 21 involutions and the affine plane of order 3 [25]. Likewise, Ys53 = Ya43 = M x Co. 
Other sporadics arise in e.g. Y533 = Y433 (the Baby Monster B), Y552 & Y442 (the Fischer 
group F'i5,), and Ys32 S Y432 (the Fischer group F%23). The Coxeter groups of Gs55, Gs53, 
G533, Gs52, and G53, are all infinite groups of hyperbolic reflections in e.g. R'”!, and con- 
tain copies of groups such as the affine Eg Weyl group, so the geometry here should be 
quite pretty. What role, if any, these remarkable presentations have in Moonshine hasn’t 
been established yet. As a first step though, [97] identifies in Aut(V4) the 21 involutions 
generating M. 


Figure 1. The graph G;;5 presenting the Bimonster 


The Monster has 194 conjugacy classes, and so that number of irreducible represen- 
tations. Its character table (and much other useful information) is given in the Atlas [22], 
where we also find analogous data for the other simple groups of ‘small’ order. For ex- 
ample, we find that M has exactly 2, 3, 4 conjugacy classes of elements of order 2, 3, 4, 
respectively — these classes are named 2A, 2B, 3A, etc. We also find that the dimensions 
of the smallest irreducible representations of M are 1, 196883, 21296876, and 842609326. 
This is the same 196883 as on the right side of (1.1), and as the dimension of the Griess 
algebra. 


2.2. The j-function. The group SL2(R), consisting of 2 x 2 matrices of determinant 
1 with real entries, acts on the upper half-plane H := {r € C|Im(r) > 0} by fractional 


linear transformations 7 
a b aT + 
p= ——— 2.3 
( c 4) ý cT +d (2.8) 


Of course this is really an action of PSL2(R) := SLe(R)/{+J/} on H, but it is more conve- 
nient to work with SL2(R). H is the hyperbolic plane, one of the three possible geometries 
in two dimensions (the others are the sphere and the Euclidean plane), and PSL2(R) is its 
group of orientation-preserving isometries. 

Let G be a discrete subgroup of SL2(R). Then the space G\H has a natural structure 
of an orientable surface, and inherits a complex structure from H (so can be regarded as 
a complex curve). By the genus of the group G, we mean the genus of the resulting real 
surface G\H. For example, the choice G = SL2(Z) yields the sphere with one puncture, so 
SL2(Z) has genus 0. Moreover, any curve © with genus g and n punctures, for 3g+n > 3, is 
equivalent as a complex curve to the space G\H, for some subgroup G of SL2(R) isomorphic 
to the fundamental group 7(%). 


The most important choice for G is SL2(Z), thanks to its interpretation as the modular 
group of the torus. Most groups G of interest are commensurable with SL (Z), i.e. GN 
SL2(Z) has finite index in both G and SL2(Z). Examples of these are the congruence 
subgroups 


T(N) a 1) € $Lo(Z) | & a = € o (mod N)}, Gag 


To(N) =i(¢ D € SL2(Z) | N divides c} . (2.4b) 


For example To(N) has genus 0 for N = 2, 13,25, while N = 50 has genus 2 and N = 24 
has genus 3. The following definition includes all groups arising in Monstrous Moonshine. 


DEFINITION 1. Call a discrete subgroup G of SLə(R) a moonshine-type modular 
group, if it contains some To(N), and also obeys the condition that 


1 ¢ : 
C |) eGiftez. 


Such a modular group is necessarily commensurable with SL2(Z). Note that for such 
a G, any meromorphic function f : G\H — C will have a Fourier expansion of the form 
{WH tng”, where q = <7". 


DEFINITION 2. Let G be any subgroup of SL2(R) commensurable with SL2(Z). By a 
modular function f for G we mean any meromorphic function f : H — C, such that 


Eso v(i bee (2.5) 


and such that, for any A € SLo(Z), the function f(A.T) has Fourier expansion of the form 
yY bg”! for some N and b, (both depending on A), and where b, = O for all but 


n=— o0 


finitely many negative n. 


ar +b 
cT+d 


This definition simply states that f is a meromorphic function on the compact surface 
“a := G\H, where H := HU QU {iœ}. The G-orbits of QU {ico} are called cusps; 
their role is to fill in the punctures of G\H, compactifying the surface, as there are much 
fewer meromorphic functions on compact surfaces than on noncompact ones (compare the 
Riemann sphere to the complex plane!). 

We are especially interested in genus 0 groups G of moonshine-type. Their modular 
functions are particularly easy to characterise: there will be a unique modular function Ja 
for G, with q-expansion of the form 


Ja(t) =a +Y ang"; (2.6) 
n=1 


poly(Ja(r)) ; 
polyJe(r)) 12 Ja. 


This function Jg is called the (normalised) Hauptmodul for the genus 0 group G. For 
example, the modular group SL2(Z) has Hauptmodul 


Jsta(z) (T) = J(T) = q7} + 196884 q + 21493760 q? + 8642909970 g” +--- . (2:7) 


the modular functions for G are precisely the rational functions f(T) = 
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This 196884 is the same as that on the left-side of (1.1). Historically, in place of this 
Hauptmodul was the equivalent 
Ge (O2(7)® + 03(7)® + 64(7)8)3 
: 8n)” 
As we know, there are other genus 0 modular groups. For example the Hauptmoduls 
for To(2), To(13), and To(25), are respectively 


= J(T) +744. 


Ja(T) =q! + 276q — 2048q? + 11202q? — 49152q4 + 18402445 +---,  (2.8a) 
Jig(ty =q! — q +20? + È +H 2g* = 2g? = 9g? a EHH, (2.8b) 
Jə5(T) =q! 5 q+ + qê =. ee 3 ge +q?! +q” =N ERVA 1s (2.8c) 


Thompson [115] proved there are only finitely many modular groups of moonshine- 
type in each genus. Cummins [28] has found all of these of genus 0 and 1. In particular 
there are precisely 6486 genus 0 moonshine-type groups. Exactly 616 of these have Haupt- 
moduls with rational (in fact integral) coefficients, the remainder have cyclotomic integer 
coefficients. There are some natural equivalences (e.g. a Galois action) which collapse this 
number to 371, 310 of which have integral Hauptmoduls. 

In genus > 0, two functions are needed to generate the function field. A complication 
facing the development of a higher-genus Moonshine is that, unlike the situation in genus 
0 considered here, there is no canonical choice for these generators. 

See e.g. [92] for a very readable account of some of the circle of ideas meandering 
through this subsection. Modular functions are discussed in e.g. [75]. 


3. The Monstrous Moonshine conjectures 


The number on the left of (1.1) is the first nontrivial coefficient of the j-function, and 
the numbers on the right are the dimensions of the smallest irreducible representations of 
the Fischer—Griess Monster M. On the one side we have a modular function; on the other, 
a sporadic finite simple group. Moonshine is the explanation and generalisation of this 
unlikely connection. 

But first, why can’t (1.1) merely be a coincidence? This is soon dispelled by comparing 
the next few coefficients of J with the dimensions of irreducible representations of M: 


214 93760 = 212 96876 + 196883 + 1 , (3.1a) 
8642 99970 = 8426 09326 + 212 96876 + 2- 196883 + 2- 1 . (3.1b) 


3.1. The fundamental conjecture of Conway and Norton. The central structure in the 
attempt to understand equations (1.1) and (3.1) is an infinite-dimensional graded module 
for the Monster: 

V=YVEUGEKV8V3E--- . (3.2a) 


If we let pq denote the d-dimensional irreducible representation of M, then the first few 
subspaces will be Vo = p1, Vi = {0}, Vo = pı © p196883, and V3 = pı © Pig6ss3 © P21296876- 
This module is to have graded dimension 


dimy (T) = X q"dim(V,,) = 1 + 196884q? + 214 937604? + --- = qJ (7) . (3.2b) 
n=0 
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Of course, (3.2b) alone certainly doesn’t uniquely determine V, but assume for now this 
V has been found. Thompson [114] suggested studying in addition the graded traces 


T,(r):=q7' S chy, (g)@” (3.2c) 
n=0 


for all g € M, where the chy, are characters. As taking g = 1 recovers J, (3.2c) is a natural 
twist of (3.2b). The functions T are now called the McKay-Thompson series. 

Conway and Norton conjectured [24] that for each element g of the Monster M, Ty is 
the Hauptmodul 


Ja yeg +)  an(g) a” (3.3) 


for a genus 0 subgroup Gg of SL2(R). So for each n the coefficient g > an(g) defines a 
character chy, (g) of M. They explicitly identify each of the groups Gg; these groups each 
contain [9(V) as a normal subgroup, for some N dividing o(g) gcd(24, o(g)) (o(g) is the 
order of g), and the quotient group Gg/To(N) has exponent 2 (or 1). 

Since Ty = Thgn-1 by definition, there are at most 194 distinct McKay—Thompson 
series. All coefficients a,,(g) are integers (as are in fact most entries of the character table 
of M). This implies that T} = T, whenever the cyclic subgroups (g) and (h) are equal. 
In fact, the total number of distinct McKay—Thompson series T} arising in Monstrous 
Moonshine turns out to be only 171. The first 50 coefficients a,(g) of each T} are given 
in [91]. Together with the recursions given in $3.3 below, this allows one to effectively 
compute arbitrarily many coefficients a,(g) of the Hauptmoduls. It is also this which 
uniquely defines V, up to equivalence, as a graded M-module. 

For example, there are two different conjugacy classes of order 2 elements. One of 
these gives the Hauptmodul J> in (2.8a), while the other corresponds to (3.4) below. 
Similarly, (2.8b) corresponds to an order 13 element, but J25 in (2.8c) doesn’t equal any Tọ. 
Recall that there are exactly 616 Hauptmoduls of moonshine-type with integer coefficients, 
so most of these don’t arise as T}. Recently [23], a fairly simple characterisation has 
been found of the groups arising as G, in Monstrous Moonshine. Their proof of this 
characterisation is by exhaustion. 

Conway coined this conjecture Monstrous Moonshine. The word ‘moonshine’ here is 
English slang for ‘insubstantial or unreal’, ‘idle talk or speculation’, ‘an illusive shadow’. It 
is meant to give the impression that matters here are dimly lit, and that [24] is ‘distilling 
information illegally’ from the character table of M. 

Monstrous Moonshine began, unofficially, in 1975 when Andrew Ogg remarked that 
the list of primes p for which the group 


Po) = Colo) (9 I (3.4) 


has genus 0, is precisely equal to the list of primes p dividing the order of M. Indeed, in 
the tables of [24] we find that, for each prime p dividing |M], an element g of M of order 
p is assigned the group Gy = To(p)+. 


3.2. Lie theory and Moonshine. McKay not only noticed (1.1), but also observed that 
j(T)3 = q7? (1 + 248q + 41244? + 347520? +--+). (3.5) 


The point is that 248 is the dimension of the defining representation of the Eg Lie group, 
while 4124 = 3875 + 248 + 1 and 34752 = 30380 + 3875 + 2 - 248 + 1. Incidentally, j3 
is a generating modular function for the genus-0 group ['(3). Thus Moonshine is related 
somehow to Lie theory. 

McKay later found independent relationships with Lie theory [89], [15], [47], reminis- 
cent of his famous A-D-E correspondence with finite subgroups of SU2(C). As mentioned 
earlier, M has two conjugacy classes of involutions. Let K be the smaller one, called ‘2A’ 
in [22] (the alternative, class ‘2B’, has almost 100 million times more elements). The 
product of any two elements of K will lie in one of nine conjugacy classes: namely, 1A, 
2A, 2B, 3A, 3C, 4A, 4B, 5A, 6A, corresponding respectively to elements of orders 1, 2, 2, 
3, 3, 4, 4, 5, 6. It is surprising that, for such a complicated group as M, that list stops 
at only 6 — we call M a 6-transposition group for this reason (more on this in $5.2). The 
punchline: McKay noticed that those nine numbers are precisely the labels of the affine 
Eg diagram (see Figure 2). Thus we can attach a conjugacy class of M to each vertex 
of the ioe diagram. An interpretation of the edges in the Es diagram, in terms of M, is 
unfortunately not known. 


Figure 2. The affine Eg, F4, and Gz diagrams with labels 


We can’t get the affine Ey labels in a similar way, but McKay noticed that an order 
two folding of affine E7 gives the affine Fy diagram, and we can obtain its labels using the 
Baby Monster B (the second largest sporadic). In particular, let K now be the smallest 
conjugacy class of involutions in B (also labelled ‘2A’ in [22]); the conjugacy classes in 


KK have orders 1, 2, 2, 3, 4 (B is a 4-transposition group), and these are the labels of Fi. 


Of course we’d prefer Ez to F, but perhaps that two-folding has something to do with 
the fact that an order-two central extension of B is the centraliser of an element g € M of 
order two. 

Now, the triple-folding of affine Eg is affine Gp. The Monster has three conjugacy 
classes of order three. The smallest of these (‘3A’) has a centraliser which is a triple cover 
of the Fischer group F'i5,.2. Taking the smallest conjugacy class of involutions in F't5,.2, 
and multiplying it by itself, gives conjugacy classes with orders 1, 2, 3 (hence F'%4,.2 is a 


3-transposition group) — and those not surprisingly are the labels of Go! 
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Although we now understand (3.5) (see §4.1) and have proven the fundamental Conway- 
Norton conjecture (see §§4.2—4.4), McKay’s Be: Fi, Gs observations still have no explana- 
tion. In [47] these patterns are extended, by relating various simple groups to the Ez 
diagram with deleted nodes. 


3.3. Replicable functions. There are several other less important conjectures. One 
which played an important role in ultimately proving the main conjecture involves the 
replication formulae. Conway-Norton want to think of the Hauptmoduls T, as being 
intimately connected with M; if so, then the group structure of M should somehow directly 
relate different T}. Considering the power map g +> g” leads to the following. 

It was well-known classically that j(7) has the property that j(pr) +j(3)+ ICT) + 

. PI) is a polynomial in j(T), for any prime p (proof: it’s a modan function for 
SL2(Z), and hence equals a rational function of j(7); since its only poles will be at the 
cusps, the denominator polynomial must be trivial). Hence the same will hold for J. More 
generally, we get 


SS ee) =0.0@); (3.6a) 


d 
ad=n,0<b<d 


where Qn is the unique polynomial for which Q,,(J(7)) —q~” has a q-expansion with only 
strictly positive powers of q. For example, Qo(x) = x? — 2a, and Q(x) = x? — 3a,2 — 3a, 
where we write J(T) =), anq”. The left side of (3.6a) is really a Hecke operator applied 
to J. These equations (3.6a) can be rewritten into recursions such as a4 = a3 + (a? —a1)/2, 
or collected together into the remarkable expression (originally due to Zagier) 


p™ [[0- pre) = F(z) — F(z), (3.60) 
mee 
where p = e?7!?, 
Conway and Norton conjectured [24] that these formulas have an analogue for any 
McKay—Thompson series T,. In particular, (3.6a) becomes 


T(E) = On lTolr)) (3.72) 


ad=n,0<b<d 


where Qn,g plays the same role for T that Qn played for J. These are called the replication 
formulae. Again, these yield recursions like a4(g) = a2(g) + (a1(g)? — a1(g?))/2, or can be 
collected into the expression 


! exp|- bD 2 amn(g 


k>0 mR 


prk gv 


| =) =). (3.7b) 


This looks a lot more complicated than (3.6b), but you can glimpse the Taylor expansion 
of In(1 — pq”) there and in fact for g = 1 (3.7b) reduces to (3.6b). 
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Axiomatising (3.7a) leads to Norton’s notion of replicable function [96], [1]. Write 
f(T) = qt er pi * and replacing each Tyo in (3.7a) with f™, use (3.7a) to 
recursively define each pe, If each f™ has a g-expansion of the form f (m) (7) =q! + 
Yia Bt”) gk — i.e. no fractional powers of q arise — then we call f = f) replicable. 
Equation (3.7a) says the McKay—Thompson series are replicable, and [30] proved that 
the Hauptmodul of any genus 0 modular group of moonshine-type is replicable, provided 
its coefficients are rational. Conversely, Norton conjectured that any replicable function 
with rational coefficients is either such a Hauptmodul, or one of the ‘modular fictions’ 
f(T) =q}, f(r) = ' +q. This conjecture seems difficult and is still open. Incidentally, 
if the coefficients pi) are irrational, then the definition (3.7a) of replicability should be 
modified to include Galois automorphisms (see §8 of [29]). Replication in positive genus 
is discussed in [109]. 

Replication (3.7a) concerns the power map g +> g” in M. Can Moonshine see more 
of the group structure of M? We explored one step in this direction in 83.2, where McKay 
modeled products of conjugacy classes using Dynkin diagrams. A different idea is given 
in §5.1. It would be very desirable to find other direct connections between the group 
operation in M and e.g. the McKay—Thompson series. 


3.4. The Leech lattice and Moonshine. The Leech lattice A = Agq4 is a 24-dimensional 
even self-dual lattice [26] which is to lattices much as the M-module V of (3.2a) turns 
out to be for vertex operator algebras (see §4.2 below). A has no vectors of odd norm, no 
norm-2 vectors, and precisely 196560 norm-4 vectors — a number remarkably close to the 
monstrous 196883. In fact its theta series O,(7) = 0, q""/?, when divided by n(r)4, 
equals J(7) + 24. Is this another example of Moonshine? 

Indeed it is. However we have: 


THEOREM 3. Let L C R” be any n-dimensional positive-definite lattice whose norms 
v-v are all rational. Lett € R” be any vector with finite order in L: 1.e. mt € L for some 
nonzero m E€ Z. Then the theta series 


Orat(T) = ys ee ae 


vEL 


divided by n(T)”, is a modular function for some T(N). 


See e.g. Theorem 20 of [100] for a proof of this classical result. If L is in fact an even 
lattice (i.e. all norms v-v lie in 2Z), we can say more. Let L* := {x € R” |x-L C Z} be the 
dual lattice. It contains L with finite index; write t;+ 2,7 = 1,..., M, for the finitely many 
cosets in L*/L. Define a column vector X(T) with ith component O4, +z(T)/n(T)”. Then 


XL forms a vector-valued modular function for SLə(Z): for any A = i a € SL2(Z), 
_ ,ar+b a 
Xx ( ) = p(A) Xx(7) (3.8) 
cT +d 


for some M-dimensional unitary matrix representation p of SL2(Z). In particular, for the 
Leech lattice L = A, M = 1 and we can quickly identify O,(7) in terms of J(7). Although 
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the 196560 ~ 196884 coincidence is thus trivial to explain, it will turn out to be a very 
instructive example of Moonshine. 

The lattices are related to groups through their automorphism groups, which will al- 
ways be finite for positive-definite lattices. The automorphism group Coo := Aut(A) of 
the Leech lattice has order about 8 x 1018, and is the direct product of Cz with Con- 
way’s sporadic group Co,. Several other sporadics are also involved in Aut(A). To each 
automorphism a € Aut(A), let 6, denote the theta series of the sublattice of A fixed 
by a. [24] also associate to each automorphism a a certain function 74(7) of the form 
II, n(ait)/T], n(0;7) built out of the Dedekind eta. Both ĝa and na are constant on each 
conjugacy class in Aut(A), of which there are 202. [24] remarks that the ratio 00/1 
always seems to equal some McKay—Thompson series T,/.). See also [87]. 

It turns out that this observation isn’t quite correct [74]. For each a € Aut(A), the 
subgroup of SLə(R) which fixes a/na is indeed genus 0, but for exactly 15 conjugacy 
classes in Aut(A), 0a/nq is not the Hauptmodul. 

Similarly, one can ask this for the Eg lattice, whose automorphism group is the Weyl 
group (of order ~ 7 x 108) of the Eg Lie group. The automorphisms a of the Eg lattice 
which yield a Hauptmodul were classified in [19]. 


4. Proof of the Monstrous Moonshine conjectures 


At first glance, the significance of the Moonshine conjectures seems very unlikely: they 
constitute after all a finite set of very specialised coincidences. The whole point though 
is to try to understand why such seemingly incomparable objects as the Monster and the 
Hauptmoduls can be so related, and to try to extend and apply this understanding to other 
contexts. Establishing the truth (or falsity) of the conjectures was merely meant as an aid 
to uncovering the why of Moonshine. Indeed, in achieving this understanding, important 
new algebraic structures were formulated. We will sketch this theory below. 

The main Conway—Norton conjecture was attacked almost immediately. Thompson 
showed [113] (see also [103]) that if g + an(g) is a character for all sufficiently small 
n (apparently n < 1300 is sufficient), then it will be for all n. He also showed that if 
certain congruence conditions hold for a certain number of a,,(g) (all with n < 100), then 
all g ++ an(g) will be virtual characters (i.e. differences of true characters of M). Atkin, 
Fong, and Smith (see [110] for details) used that to prove on a computer that indeed all 
an(g) were virtual characters (they didn’t quite reach n = 1300 though). But their work 
doesn’t say anything more about the underlying (possibly virtual) representation V, other 
than its existence. Their work plays no role in the following. 

We want to show that the McKay—Thompson series T,(7) of (3.2c) equals the Haupt- 
modul Jg, (7) in (3.3). First, we need to construct the infinite-dimensional module V of M. 
We discuss this, and the underlying theory of vertex operator algebras, in §4.2. Borcherds’ 
strategy [11] was to bring in Lie theory, by associating to the module V a ‘Monster Lie 
algebra’. This algebra, and the underlying theory of generalised Kac-Moody algebras, 
is described in §4.3. In the final subsection we go from the Monster Lie algebra to the 
replication formulae, and conclude the proof. We begin this section though by explaining 
the much simpler connection of Eg with j3. 


4.1. Eg and j 3, An explanation for the relation between Eg and 7 3 was found 
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almost immediately, by Kac and Lepowsky [65], [76]: j3 is the (normalised) character of 
a representation of the affine Kac-Moody algebra EY . Given a finite-dimensional simple 
Lie algebra g, the affine algebra g“) is the infinite-dimensional Lie algebra consisting of all 
Laurent polynomials eae ant” where an E g and t is an indeterminate, together with 
a central term and derivation D = —L (see [66], [70]). Highest weight representations 
are defined in the usual way. Thanks largely to the fact that the affine Weyl group is a 
semi-direct product of the additive group Z” (r = rank(g)) with the finite Weyl group, the 
characters of these representations (especially the ‘integrable’ highest weight ones, which 
are the direct analogue of the finite-dimensional representations of g) transform nicely 
with respect to SL2(Z). See e.g. Chapter 13 of [70] for details. This is probably the single 
biggest reason Kac-Moody algebras are so well-known. 


THEOREM 4. [69] Let g be any finite-dimensional Lie algebra, and g denote the 
corresponding affine algebra. Let Pe denote the finitely many ‘level k integrable highest 
weight modules’ Ly of g). The g™ -module Ly has a natural Z-grading Ly = Ore o(Ladn 
into finite-dimensional g-modules (Ly)n. Let xa(T) = q~°/*4 EZ, dim((Ly)n) q” be 
the corresponding normalised character (for some appropriate choice of hy — c/24 € Q). 
Then each x, is a holomorphic function in H, and the vector X(T) with entry xa(T) for 
each A € PË defines a vector-valued modular function for SL2(Z), as in (3.8), for some 
finite-dimensional unitary representation p of SL2(Z). 


In fact each character xa will be a rational function in lattice theta series, and so will 
be a modular function for some T(N). It turns out that there is only one level 1 integrable 


highest-weight representation of EÑ, and its character equals j3. The modularity of j3 
is thus predicted by Kac-Moody theory, and the fact that the coefficients are dimensions 
of Eg representations is automatic. We will see a simultaneous generalisation of Theorems 
3 and 4 next section. 


We’ve already encountered the mysterious normalisation g~! of the McKay—Thompson 
hy—c/24 


series, and the q~3 of the EY) character, and more generally the q of xa. Many 
explanations have been provided for this pervasive factor. For example, to [2] it is topolog- 
ical in origin, and related to the Atiyah—Singer Index Theorem; a geometric interpretation 
using determinant line bundles is due to Segal [107]. In quantum physics it’s called the 
conformal anomaly (a breakdown of manifest conformal symmetry when the classical sys- 
tem is quantised), and is introduced in regularisation as a vacuum energy. Probably the 
simplest instance of it is the prefactor in the familiar definition n(r) = q7 [2,0 -¢”) for 
the Dedekind eta: reading through classical proofs for its modularity we find that a here 
arises through the combination ¢(2)/(27)?; n appears also in physics in the partition func- 
tion of the bosonic string, and that same ‘34’ arises there via regularisation as —¢(—1)/2. 
The equivalence of these two expressions for om comes from the functional equation of the 
Riemann zeta. This same zeta value appears famously in the central term of the Virasoro 
algebra (4.3), and Bloch [8] found other zeta values appearing in other algebras of differ- 
ential operators, many of which have now been interpreted and generalised (starting with 
[77]) within the vertex operator algebra framework. 

Although a direct explanation for Monstrous Moonshine using affine algebras has never 
been found (and certainly isn’t expected), the theory of Kac-Moody algebras influenced 
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every stage of the ultimate proof. For this reason we'll briefly sketch their theory. A 
simple finite-dimensional Lie algebra g is built out of the 3-dimensional algebra slo, in a 
simple way; the Dynkin diagram of g encodes the exact presentation. In the identical way, 
Kac—Moody algebras are also built out of copies of slg — the only difference is that the 
finite-dimensionality constraint (a positive-definiteness condition on the Cartan matrix) is 
lifted. Their structure is completely analogous to that of the simple Lie algebras: e.g. it 
has a grading by roots into finite-dimensional spaces; it has a triangular decomposition 
(making Verma modules possible); it has an invariant symmetric bilinear form. See e.g. 
[66], [70] for details. The affine algebras are the class of Kac-Moody algebras especially 
analogous to the finite-dimensional ones. 


4.2. The Moonshine module V”. A vital component of the Monstrous Moonshine 
conjectures came a few years after [24]. In a deep work, Frenkel-Lepowsky—Meurman 
[43], [44] constructed a graded infinite-dimensional representation V4 of M and conjectured 
(correctly) that it is the representation V in (3.2a). V” has a very rich algebraic structure: 
it is in fact a vertex operator algebra! 

A vertex operator algebra [9], [44], [67], [39], [79] is an infinite-dimensional vector 
space V with infinitely many heavily constrained bilinear products ux, v. The name means 
‘algebra of (generalised) vertex operators’; vertex operators are formal differential oper- 
ators which originally appeared in physics as quantum fields describing the creation and 
propagation of physical strings (see §6 below), and were constructed later but indepen- 
dently by Lie theorists (starting with Lepowsky and Wilson) to realise affine Kac-Moody 
algebras as algebras of differential operators. Because there were vertex operator construc- 
tions associated to lattices, affine algebra modules, and string theory, and all of these have 
connections to modular functions, it was natural to use vertex operators to try to construct 
the M-module V of (3.2a). 

The definition of vertex operator algebra (VOA) is too complicated to give in detail 
here. A VOA is a graded infinite-dimensional vector space V = @?@9Vn, where each V, is 
finite-dimensional. To simplify the discussion, we will limit ourselves in this paper to VOAs 
with one-dimensional Vo, which is typical of the examples relevant to Moonshine (and 
conformal field theory). To each vector v € V we assign a vertex operator Y (v, z), which is 
a formal power series Y (v, z) = Yi mez GC a ceaiae with coefficients v(m) € End(V). The 
vertex operator is just the generating function for the products: u *n v = u(n) (v). These 
products respect the grading — in particular, 


Vr *n Ve © Vere-n-1 - (4.1) 


A key axiom, which collects together all the identities obeyed by the products u *,, v, can 
be written as 


(z — w)™ [Y (u, z), Y(v, w)] = 0 Vu,verv, (4.2a) 


for some integer M (depending on u, v), where the bracket in (4.2a) means the commutator 
Y(u, z) Y(v,w) —Y(v, w) Y(u, z). This strange-looking formula really says that each such 
commutator is a linear combination of Dirac deltas and their derivatives, all centred at z = 
w (see e.g. Corollary 2.2 in [67]). Equation (4.2a) implies more down-to-earth identities, 
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such as j 
= _1)\t f .— (1% , ; 
(uev)n = zi 1) (‘) (tpg © Unyi — (—1) Ve4n—i © ui) . (4.2b) 
i>0 
There are two distinguished elements in V: the identity 1 € Vo (so Vo = C1) and 
the conformal vector w € V2. The identity obeys Y(1,z) = id, i.e. 1(n)v = bn,-1v. More 
interesting is the conformal vector: writing Ln = w(n+1), the operators Ln are required to 
form a representation on VY of the Virasoro algebra: 


3 
(ie hw Laat m, -n — z cidy , (4.3) 


for some number c € R (an important numerical invariant of V) called the rank or central 
charge of the VOA. In addition we require Lou = nu whenever u € Vp, and L_, acts on V 
as a derivation. 

The appearance here of the Virasoro algebra is fundamental. It is the unique nontrivial 
central extension of the (polynomial) vector fields on S1, which in turn is the Lie algebra 
associated to the group Diff(S+) of diffeomorphisms of the circle. 

The notion of a VOA may seem very arbitrary, but as we’ll mention in §6 it is the 
‘chiral algebra’ of a conformal field theory. The simplest (and least interesting) special case 
of a VOA occurs when M = 0 in (4.2a) — i.e. when all vertex operators commute. Then 
it is not hard to see that, for each choice of z 4 0, V would be a commutative associative 
algebra with unit, whose product is given by u x, v := Y(u,z)v. A more honest way to 
motivate VOAs has been suggested by Huang: binary trees can be used to keep track of the 
brackets in nested products, e.g. a((bc)d), and e.g. Lie algebras can be easily formulated 
using this language [58]; in a monumental work [59], Huang ‘two-dimensionalised’ this Lie 
algebra formulation by replacing binary trees with spheres with tubes, and showed that 
the result is equivalent to a VOA. 

A relation between VOAs and Lie algebras also exists at a more elementary level. 
Placing / = n = 0 in (4.2b), and evaluating it on the right by w € V, gives 


(uov)ow = Uo(vow) — voluow) . (4.4) 


Writing [xy] for xoy, this becomes the Lie algebra Jacobi identity, at least when [xy] = 
—lyx]. There are different ways to obtain from this a true Lie algebra. The simplest 
is that V, will be a Lie algebra for that bracket; moreover, the number (u,v) defined by 
u,v = (u,v) 1 is an invariant bilinear form for this Lie algebra (by (4.2b) with £ = 0,n = 1). 
Equation (4.4) tells us each V, is a Vj-module, and in fact e” will be an automorphism of 
V for any u € Vı. In the most common examples, Vı will be reductive (i.e. a direct sum 
of simple and abelian Lie algebras). 

Modules M of V can be defined in the obvious way [42], [79] — e.g. for each u € V, each 
Un) Will be in End(M). All (irreducible) modules M come with a Z-grading M = O72.9Mn, 
where upM, C Mn+e-k—1 for all u € Ve, and Lor = (n+h)x for any x € My, where h is 
some number (the conformal weight) depending only on M. The (normalised) character 
of M is 


xu (T) = qo! Teg” = gence X dim(Mn) q” f 


n=0 
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It takes a little effort to construct even the simplest examples of VOAs. The best- 
behaved ones are called rational VOAs [124] (borrowing on terminology from physics) and 
have only finitely many irreducible modules. A rational VOA is associated to any even 
positive-definite lattice L, and their modules are in one-to-one correspondence with the 
cosets L*/L. Another important example: to any affine nontwisted Kac—Moody algebra 
and choice of positive integer k (the ‘level’), the highest weight module Lks has a natural 
VOA structure, and its modules are precisely the affine algebra modules L for each highest 
weight À € pE, 

One of the deepest results in the theory of VOAs is due to Zhu: 


THEOREM 5. [124] Let V be a rational VOA. Its characters xm(T) are holomorphic 
in H, and the subspaces M,, carry representations for Aut(V). Write Xy(T) for the vector 
whose components are the characters Xm(T) of irreducible modules M. Then Xy is a 
vector-valued modular function for SL2(Z). 


It is believed that the characters xm (T) themselves will be modular functions for some 
T(N); significant progress towards this was made in [5] (see also [71]). The proof of Zhu’s 
Theorem is much more difficult than that of Theorem 4, which it generalises. 

The automorphism group Aut(V) is by definition required to fix w, which is why it 
respects the grading of V. Aut(V) is how group theory impinges on VOA theory. Since the 
automorphism group Aut(V) of a VOA contains e”! as a (normal) subgroup, Aut(V) can 
be finite only when Y; = 0. Zhu’s Theorem tells us that Moonshine (without the genus-0 
aspect) will hold between the group Aut(V) and the functions y,,(7), for any rational 
VOA. 

The most famous example of a VOA is the Moonshine module V* of [44]. It is the 
orbifold of the Leech lattice VOA Va by the +1-symmetry of A, which means it’s the direct 
sum of two parts: an invariant part V} and a twisted part vi (more on this in 85.1). The 
orbifold serves two purposes: it removes the constant term ‘24’ from the graded dimension 
J + 24 (hence the subspace (Vq)1) of Va; and it enhances the symmetry from the discrete 
part of Aut(V,), which is an extension of Cog by (C2)*", to all of M. 

A major claim of [44] was that V” is a ‘natural’ structure (hence their notation). Even 
so, this bipartite structure to V? complicates its study. We have vë = Cl, as usual, but 
the Lie algebra Vı = {0} is trivial. For any such VOA, the space V2 will be a commutative 
nonassociative algebra with product u x v := u,v and identity 4w. For the Moonshine 
VOA V4, this can be shown (with effort!) to be the 196883-dimensional Griess algebra 
extended by an identity element. From this, we find the automorphism group of V! to 
be the Monster M. The only irreducible module for V” is itself — such a VOA is called 
holomorphic. Together with Zhu’s Theorem, this implies that its character, namely J(7), 
must be a modular function for SL2(Z) (strictly speaking, we only get invariance up to a 
1-dimensional character of SL2(Z), but it is easy to show that character must be identically 
1). We’ll see in §5.1 how to obtain the other McKay~Thompson series from V4. 

Conjecturally, there are 71 holomorphic VOAs with rank c = 24 [106]. Much as the 
Leech lattice is the unique even self-dual positive-definite lattice of dimension 24 contain- 
ing no norm-2 vectors [26], the Moonshine module V® is conjecturally [44] the unique 
holomorphic VOA with c = 24 and with trivial V,. Thus, just as the Leech lattice is the 
unique lattice with theta series O4, so (conjecturally) is the Moonshine module the unique 
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holomorphic VOA with (normalised) graded dimension J. Proving this is one of the most 
important (and difficult) challenges in the subject. 


4.3. The Monster Lie algebra m. To show that all of the McKay—Thompson series T% 
are indeed Hauptmoduls, Borcherds needed identities satisfied by their q-expansions. He 
obtained these through a Lie algebra he associated to V". Before discussing it, let’s briefly 
describe Borcherds’ generalisation of Kac-Moody algebras [10]. 

A Borcherds—Kac—Moody algebra differs from a Kac-Moody algebra in that it is 
built up from Heisenberg algebras as well as slo, and these subalgebras intertwine in more 
complicated ways. Nevertheless much of the theory for finite-dimensional simple Lie al- 
gebras continues to find an analogue in this much more general setting (e.g. root-space 
decomposition, Weyl group, character formula,...). This unexpected fact is the point of 
Borcherds-Kac—Moody algebras. For reasons of space we avoid giving here the fairly simple 
definition, but for this and much more see the review articles [53], [63], [102]. 

Their basic structure theorem is that of Kac-Moody algebras. In particular, there is a 
grading by roots into finite-dimensional spaces (except that the 0-graded piece, correspond- 
ing to the Cartan subalgebra, may be infinite-dimensional). They also have a triangular- 
isable decomposition and an invariant symmetric bilinear form. Indeed, these structural 
properties characterise Borcherds-—Kac—Moody algebras. In this sense Borcherds—Kac— 
Moody algebras are the ultimate generalisation of simple Lie algebras, in that any further 
generalisation would lose some basic structural ingredient. 

In short, Borcherds’ algebras strongly resemble the Kac-Moody ones and constitute a 
natural and nontrivial generalization. The main differences are that they can be generated 
by copies of the 3-dimensional Heisenberg algebra as well as sly, and that there can be 
imaginary simple roots. Borcherds introduced these algebras and developed their theory 
in order to understand the Monster Lie algebra m. 

We want to construct m from the Moonshine module V! = Vv ® VE D- For later 
convenience, relabel its subspaces V* := Viis Of course the obvious choice VE = V” is 
0-dimensional, so we must modify V” first. Let II 1,1 denote the even self-dual indefinite 
lattice consisting of all pairs (m,n) € Z? with inner product (m, n): (m, n’) = mn! +nm’. 
Because it is indefinite, the usual construction of a VOA from a lattice will fail here to 
produce a true VOA, but most properties will be obtained. Call this near-VOA, V1.1. 

The Monster Lie algebra m is a Lie algebra associated to the near-VOA V! Q Vit 
— see [11] for the details. m inherits a II ı-grading from V1, and this is its root 
space decomposition: the (m,n) root space is isomorphic (as a vector space) to V™”, if 
(m,n) Æ (0,0); the (0,0) piece is isomorphic to R?. Structurally, the Monster Lie algebra 
has a decomposition m = ut @ gl, ® uT into a sum of Lie subalgebras, where u®™ are free 
Lie algebras (see e.g. [64]). It inherits the action of M from V". 

This construction of m may seem indirect; an alternate approach, anticipated in [11] 
and [12], uses Moonshine cohomology [81] — a functor, inspired by BRST cohomology in 
conformal field theory, assigning to certain c = 2 near-VOAs some Lie algebra carrying an 
action of M. To Vı,ı this functor associates m. 


4.4. Denominator identities and modular equations. It was discovered early on that 
the Hauptmoduls all obey the replication formulae, and that anything obeying those for- 
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mulae will be determined by their first few coefficients. The idea then is to show that 
the McKay~Thompson series T} of (3.2c) also are replicable. Borcherds did this using Lie 
algebra denominator identities [11]. 

Finite-dimensional simple Lie algebras g possess a very useful formula for their char- 
acters, due to Weyl: the (formal) character x, of a module Ly equals 


ee e(w) ee?) 


xa = J dim(La(u)) et = e? SESE 


H 


(4.5) 


where W is the Weyl group, A, the positive roots, e(w) = det(w) is a sign, and where 
@,L(p) is the weight-space decomposition of Là. As the weights u by definition lie in the 
dual h* of the Cartan subalgebra of g, the character x) can be regarded as a complex-valued 
function on the space h = C” (r = rank(g)). 

Consider the trivial representation: i.e. x + O for all x € g. Its character yo will 
be identically 1. Thus the character formula (4.5) tells us that a certain alternating sum 
over a Weyl group, equals a certain product over positive roots. These formulas, called 
denominator identities, are nontrivial even in this finite-dimensional case. 

In a famous paper [83], Macdonald generalised the denominator identity for (4.5), 
to infinite sum/product identities, corresponding to the extended Dynkin diagrams. The 
simplest one was known classically as the Jacobi triple product identity: 


DY (ire y= [] a-2 1-2 yaa ty). (4.6) 


Macdonald’s identities were later reinterpreted, by Kac and Moody, as denominator identi- 
ties for the affine algebras. For example, we now know (4.6) to be the denominator identity 
for the algebra AW, 

In particular, the same formula (4.5) holds for Kac-Moody algebras, except that the 
sum and product are now infinite, the positive roots now come with multiplicities, and 
the characters are usually normalised by a prefactor g”—°/24. The variable r in Theorem 
4 is one of the coordinates in the Cartan subalgebra C’*? of the affine algebra (see e.g. 
equation (13.2.4) of [66]). In that theorem we dropped the remaining variable dependence 
of the y, for readability, although those additional coordinates serve the important role of 
guaranteeing linear independence of the characters, and of giving us an action of SL2(Z) 
rather than merely PSL2(Z). 

Because a Borcherds-Kac—Moody algebra g is triangularisable, highest weight g- 
modules can be defined in the usual way from Verma modules. The character formula 


becomes 5 

ew elw) w(e TeSa) 
Iben aa e72 ate , 
where 5) is a correction factor due to imaginary simple roots. 


The corresponding denominator identity of the Monster Lie algebra m can be com- 
puted, and is given in (3.6b). Its Weyl group is C2 and sends the (m, n)-root space to 


Cea’ (4.7) 
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(n,m); the (m,n) root has multiplicity given by coefficient amn of J; for each n > 0 we 
have an imaginary simple root (1,7) with multiplicity an. Because of a cohomological in- 
terpretation of all denominator identities, (3.6b) can be ‘twisted’ by each g € M, and this 
gives (3.7b). These formulas are equivalent to the replication formula (3.7a) conjectured 
in §3.3. 

Identities equivalent to (3.7b) were obtained by more elementary means — i.e. meth- 
ods requiring less of the theory of Borcherds-Kac-Moody algebras — in [64] and [68], 
permitting a simplification of Borcherds’ proof at this stage. 

Now, it turns out that if we verify for each conjugacy class K, of M that the first, 
second, third, fourth and sixth coefficients of the McKay—Thompson series T} and the 
corresponding Hauptmodul Jg, agree, then indeed T} = Jg,. That is precisely what 
Borcherds then did: he compared finitely many coefficients, and as they all equalled what 
they should, this concluded the proof [11] of Monstrous Moonshine! 


However, this case-by-case verification occurred at the critical point where the McKay— 
Thompson series were being compared directly to the Hauptmoduls, and so provides little 
insight into why the T, are genus 0. Fortunately a more conceptual explanation of their 
equality has since been found. 

A function f obeying the replication formulae (3.7a) will also obey modular equations 
— i.e. a 2-variable polynomial identity satisfied by f(x) and f(nx). The simplest examples 
come from the exponential and cosine functions: note that for any n > 0, exp(nz) = 
(exp(xz))” and cos(na) = T;,,(cos(x)) where T, is a Tchebychev polynomial. It was known 
classically that j (hence J) satisfied a modular equation for any n: e.g. put X = J(r) and 
Y = J(2r), then 


(X?—Y)(Y? — X) =393768 (X? + Y?) + 42987520 XY + 40491318744 (X +Y) 
— 120981708338256 . 


The only functions f(T) = q7t + arq +--+ which obey modular equations for all n, 
are J(T) and the ‘modular fictions’ q~' and q~! + q (which are essentially exp, cos, and 
sin) [72]. More generally, we have: 


THEOREM 6. [29] A function B(T) = q7'+3->~~, bng” which obeys a modular equation 
for alln =1 (mod N), will either be of the form B(T) = q7! +bıq, or will be a Hauptmodul 
for a modular group of moonshine-type. 


The converse is also true [29]. The denominator identity argument tells us each Tg 
obeys a modular equation for each n = 1 modulo the order of g, so Theorem 6 then 
concludes the proof of Monstrous Moonshine. 

The computer searches in [20] suggest that the hypothesis of Theorem 6 may be 
considerably weakened, perhaps all the way down to the existence of modular equations 
for any two distinct primes. 


5. Further developments 


5.1. Orbifolds. About a third of the McKay—Thompson series T} will have some neg- 
ative coefficients. In §5.4 we’ll see Borcherds interpret them as dimensions of superspaces 
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(which come with signs). In an important announcement [97], on par with [24], Norton 
proposed that, although T,(—1/7) will not usually be another McKay~Thompson series, 
it will always have nonnegative integer q-coefficients, and these can be interpreted as ordi- 
nary dimensions. In the process, he extended the g — Ty assignment to commuting pairs 
(g,h) E M x M. 

In particular, to each such pair we have a function N(g,h;7), which we will call a 
Norton series, such that 


arc bid. = GF 0 a b 
NH ghir) =a Niga =) v( G)eSlalZ), 6D 


for some root of unity a (of order dividing 24, and depending on g, h, a,b,c,d). The 
Norton series N (g, h; T) is either constant, or generates the modular functions for a genus-0 
subgroup of SL2(Z) containing some T(N) (but otherwise not necessarily of moonshine- 
type). Constant N(g,h;T) arise when all elements of the form g%h? (for gcd(a,b) = 1) 
are ‘non-Fricke’ (an element g € M is called Fricke if the group G, contains an element 
sending 0 to ioo — the identity 1 is Fricke, as are 120 of the 171 G,). Each N (g, h; T) has 
a q7 -expansion for that N; the coefficients of this expansion are characters evaluated at h 
of some central extension of the centralizer Cm(g). Simultaneous conjugation of g, h leaves 
the Norton series unchanged: N(aga™t,aha™t; T) = N(g, h; T). 

For example, when (g, h) = C2 x C2 and g, h, gh are all in class 2A, then N(g, h; T) = 
\/ J(T) — 984. The McKay-Thompson series are recovered by the g = 1 specialisation: 
N(1,h;7) = Ta(T). This action (5.1) of SL2(Z) is related to its natural action on the 
fundamental group Z? of the torus, as we’ll see in §6, as well as a natural action of the 
braid group, as we’ll see next subsection. Norton arrived at his conjecture empirically, by 
studying the data of Queen (see §5.3). 

The basic tool we have for approaching Moonshine conjectures is the theory of VOAs, 
so we need to understand Norton’s suggestion from that point of view. For reasons of 
space, we'll limit this discussion to V', but it generalises. Given any automorphism g € 
Aut(V4), we can define g-twisted modules in the obvious way [36]. Then for each g € M, 
there is a unique g-twisted module, call it V5(g), for V! — this statement generalises 
the holomorphicity of V” mentioned in §4.2. More generally, given any automorphism 
h € Aut(V4) commuting with g, h will yield an automorphism of V’ (g), so we can perform 
Thompson’s twist (3.2c) and write 


qo Tryacgyh q™? =: Z(g, hit) . (5.2) 


These Z(g,h)’s can be thought of as the building blocks of the graded dimensions of 
various eigenspaces in V4(g): e.g. if h has order m, then the subspace of V5(g) fixed by 
automorphism h will have graded dimension m~t X:+; Z(g, h’). In the case of the Monster 
considered here, we have Z(g,h) = N(g, h). 

The important paper [36] proves that, whenever the subgroup (g, h) generated by g 
and h is cyclic, then N(g,h) will be a Hauptmodul satisfying (5.1). One way this will 
happen of course is whenever the orders of g and h are coprime. Extending [36] to all 
commuting pairs g, h is one of the most pressing tasks in Moonshine. 
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This orbifold construction is the same as was used to construct V" from Va: V” is the 
sum of the ‘.’-invariant subspace vi of Va with the ‘v’-invariant subspace V! of the unique 
‘—1’-twisted module for Va, where . € Aut(A) is some involution. The graded dimensions 
of Vi are 2~!(Z(+1,1) + Z(+1, )), respectively, and these sum to J. 

The orbifold construction is also involved in an interesting reformulation of the Haupt- 
modul property, due to Tuite [116]. Assume the uniqueness conjecture: V4 is the only 
VOA with graded dimension J. He argues from this that, for each g € M, T} will be a 
Hauptmodul iff the only orbifolds of V? are Va and V" itself. In e.g. [62], this analysis is 
extended to some of Norton’s N(g, h)’s, where the subgroup (g, h) is not cyclic (thus going 
beyond [36]), although again assuming the uniqueness conjecture. 


5.2. Why the Monster? That M is associated with modular functions can be explained 
by it being the automorphism group of the Moonshine VOA V%. But what is so special 
about this group M that these modular functions T} and N (g, h) should be Hauptmoduls? 
This is still open. One approach is due to Norton, and was first (rather cryptically) stated 
in [97]: the Monster is probably the largest (in a sense) group with the 6-transposition 
property. Recall from 83.2 that a k-transposition group G is one generated by a conjugacy 
class K of involutions, where the product gh of any two elements of K has order < k. For 
example, taking K to be the transpositions in the symmetric group G = Sn, we find that 
Sn is 3-transposition. 

A transitive action of T := PSL2(Z) on a finite set X with one distinguished point 
£o E€ X, is equivalent to specifying a finite index subgroup Io of I. In particular, Io is the 
stabiliser {g € T | g.£o = xo} of zo, X can be identified with the cosets [o\P, and xo with 
the coset lo. (If we avoid specifying xo, then To will be identified only up to conjugation.) 

To such an action, we can associate an interesting triangulation of the closed surface 
To\H, called a (modular) quilt. The definition, originally due to Norton and further devel- 
oped by Parker, Conway, and Hsu, is somewhat involved and will be avoided here (but see 
especially Chapter 3 of [57]). It is so-named because there is a polygonal ‘patch’ covering 
every cusp of I'9\H, and the closed surface is formed by sewing together the patches along 
their edges (‘seams’). There are a total of 2n triangles and n seams in the triangulation, 
where n is the index ||['9\I|| = ||X||. The boundary of each patch has an even number of 
edges, namely the double of the corresponding cusp width. The familiar formula 


n nə n3 Noo 
Sie ah ae 
ae Rg 


for the genus y of To\H in terms of the index n and the numbers n; of To-orbits of fixed- 
points of order 7, can be interpreted in terms of the data of the quilt (see (6.2.3) of [57]), 
and we find in particular that if every patch of the quilt has at most 6 sides, then the 
genus will be 0 or 1, and genus 1 only exceptionally. 

In particular, we’re interested in one class of these T-actions (actually an SL2(Z)- 
action, but this doesn’t matter). Recall that the braid group B3 has presentation 


(01, 02 | 010201 = 020102) , (5.3a) 
and centre Z = ((010201)") [7]. It is related to the modular group by 
B3/Z = PSLo(Z) , Bs/((o10201)*) S SLo(Z) . (5.30) 
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Fix a finite group G (we’re most interested in the choice G = M). We can define a 
right action of Ba on triples (g1, 92,93) € G® by 


(91, 92,93)01 = (919297 ', 91; 93) , (91,92, 93)02 = (91, 929395 +92) - (5.4a) 


We will be interested in this action on the subset of G? where all g; € G are involutions. 
The action (5.4a) is equivalent to a reduced version, where we replace (g1, 92,93) with 
(9192, 9293) E G*. Then (5.4a) becomes 


(9, h)or = (g, gh) ’ (9, h)o2 = (ghh) g (5.4b) 


These B, actions come from specialisations of the Burau and reduced Burau representa- 
tions [7], respectively, and generalise to actions of B, on G” and G”7t. We can get an 
action of SL2(Z) from the B3 action (5.4b) in two ways: either 

(i) by restricting to commuting pairs g, h; or 

(ii) by identifying each pair (g, h) with all its conjugates (aga~', aha~?). 
Norton’s SL2(Z) action of §5.1 arises from the B3 action (5.4b), when we perform both (i) 
and (ii). 

The quilt picture was designed for this SL2(Z) action. The point of this construction 
is that the number of sides in each patch is determined by the orders of the corresponding 
elements g,h. If G is say a 6-transposition group (such as the Monster), and we take the 
involutions g; from 2A, then each patch will have < 6 sides, and the corresponding genus 
will be 0 (usually) or 1 (exceptionally if at all). In this way we can relate the Monster with 
a genus-0 property. 

Based on the actions (5.4), Norton anticipates some analogue of Moonshine valid 
for noncommuting pairs. CFT considerations (‘higher genus orbifolds’) alluded to in §6 
suggest that more natural should be e.g. quadruples (g, g’, h, h’) € Mt obeying ghg~'h7! = 
Wola tg'— 

An interesting question is, how much does Monstrous Moonshine determine the Mon- 
ster? How much of M’s structure can be deduced from e.g. McKay’s Eg Dynkin diagram 
observation, and/or the (complete) replicability of the T,, and/or Norton’s conjectures in 
85.1, and/or Modular Moonshine in §5.4 below? A small start toward this is taken in [99], 
where some control on the subgroups of M isomorphic to Cp x Cp (p prime) was obtained, 
using only the properties of the series N(g,h). For related work, see Chapter 8 of [57]. 


5.3. Other finite groups. It is natural to ask about Moonshine for other groups. 
Indeed, the Hauptmodul for I'9(2)+ looks like 


q |! + 4372q + 96256q7 + 12 400024? + --- (5.5a) 
and we find the relations 
4372 = 4371 + 1, 96256 = 96255 + 1, 12 40002 = 11 39374 + 4371 + 2-1 , (5.5b) 


where 1, 4371, 96255, and 1139374 are all dimensions of irreducible representations of the 
Baby Monster B. Thus we find Moonshine for B! We will return to this example shortly. 
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Of course any subgroup of M automatically inherits Moonshine by restriction, but 
obviously this isn’t interesting. Most constructions of the Leech lattice start with Mathieu’s 
sporadic Mp, (see e.g. Chapters 10 and 11 of [26]), and most constructions of the Monster 
involve the Leech lattice. Thus we are led to the following natural hierarchy of (most) 
sporadics: 

(1) Mə4 (from which we can get M11, M2, M22, Mo3); which leads to 
(2) Cog = Ca x Co, (from which we get HJ, HS, McL, Suz, Co3, Coz); which leads to 
(3) M (from which we get He, Fig2, Fi23, Fiha, HN, Th, B). 

It can thus be argued that we could approach problems in Monstrous Moonshine, by 
first addressing in order Mə4 and Co,, which should be much simpler. Indeed, the full 
VOA orbifold theory — i.e. the complete analogue of §5.1 — for Mə4 has been established 
in [38] (the relevant series Z(g, h) had already been constructed in [88]). 

Largely by trial and error, Queen [101] established Moonshine for the following groups 
(all essentially centralisers of elements of M): Coo, Th, 3.2.Suz, 2.HJ, HN, 2.47, He, Mie 
(by e.g. ‘2.HJ’ we mean C2 is normal and HJ is the quotient 2.HJ/C2). In particular, to 
each element g of these groups, there corresponds a series Qg(T) = q7! + Yoo an(g)g”, 
which is a Hauptmodul for some modular group of moonshine-type, and where each g +> 
an(g) is a virtual character. For Th, HN, He and Mjg it is a proper character. Other 
differences with Monstrous Moonshine are that there can be a preferred nonzero value for 
the constant term ao, and that although To(N) will be a subgroup of the fixing group, it 
won’t necessarily be normal. We will return to these results next section, where we will see 
that many seem to come out of the Moonshine for M. About half of Queen’s Hauptmoduls 
Qg for Coo do not arise as a McKay—Thompson series for M. Norton’s conjectures in 85.1 
are a reinterpretation and extension of Queen’s work. 

Queen never reached B because of its size. However, the Moonshine (5.5) for B falls 
into her and Norton’s scheme because (5.5a) is the McKay—Thompson series associated to 
class 2A of M, and the centraliser of an element in 2A is a double cover of B. 

There can’t be a VOA V = @,,V, with graded dimension (5.5a) and automorphisms 
in B, because e.g. the B-module V3 doesn’t contain V2 as a submodule. However, Hohn 
deepened the analogy between M and B by constructing a vertex operator superalgebra 
VB’ of rank c = 23.5, called the shorter Moonshine module, closely related to V5 (see e.g. 
[56]). Its automorphism group is C2 x B. Just as M is the automorphism group of the 
Griess algebra VA, so is B the automorphism group of the algebra (VB')2. Just as V” is 
associated to the Leech lattice A, so is VB’ associated to the shorter Leech lattice O23, 
the unique 23-dimensional positive-definite self-dual lattice with no vectors of length 2 or 
1 (see e.g. Chapter 6 of [26]). The automorphism group of O23 is C2 x Coz. 

There has been no interesting Moonshine rumoured for the remaining six sporadics 
(the pariahs Jı, J3, Ru, ON, Ly, J4). There will be some sort of Moonshine for any group 
which is an automorphism group of a vertex operator algebra (so this means any finite 
group [37]!). Many finite groups of Lie type should arise as automorphism groups of VOAs 
associated to affine algebras except defined over finite fields. But apparently all known 
examples of genus-O Moonshine are limited to the groups involved with M. 


5.4. Modular Moonshine. Consider an element g € M. We expect from [101], [97], 
[36] that there is a Moonshine for the centraliser Cy(g) of g in M, governed by the g- 
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twisted module V5(g). Unfortunately, V4(g) is not usually itself a VOA, so the analogy 
with M is not perfect. Ryba found it interesting that, for g € M of prime order p, Norton’s 
series N(g,h) is a McKay—Thompson series (and has all the associated nice properties) 
whenever h is p-regular (i.e. h has order coprime to p). This special behaviour of p-regular 
elements suggested to him to look at modular representations. 

The basics of modular representations and Brauer characters are discussed in sufficient 
detail in Chapter 2 of [31]. A modular representation p of a group G is a representation 
defined over a field of positive characteristic p dividing the order |G| of Œ. Such represen- 
tations possess many special (i.e. unpleasant) features. For one thing, they are no longer 
completely reducible (so the role of irreducible modules as direct summands will be re- 
placed with their role as composition factors). For another, the usual notion of character 
(the trace of representation matrices) loses its usefulness and is replaced by the more subtle 
Brauer character 3(p): a complex-valued class function on M which is only well-defined 
on the p-regular elements of G. 


THEOREM 7. [105], [17], [13] Let g © M be any element of prime order p, for any p 
dividing |M|. Then there is a vertex operator superalgebra 9V = Bnez’ Vn defined over the 
finite field Fp and acted on by the centraliser Cm(g). If h € Cu(g) is p-regular, then the 
graded Brauer character 


Rigo" > CCValhya" 


nEZ 


equals the McKay-Thompson series Tgn(T). Moreover, for g belonging to any conjugacy 
class in M except 2B, 3B, 5B, 7B, or 18B, this is in fact an ordinary VOA (i.e. the ‘odd’ 
part vanishes), while in the remaining cases the graded Brauer characters of both the odd 
and even parts can separately be expressed using McKay—Thompson series. 


By a vertex operator superalgebra, we mean there is a Zo-grading into even and odd 
subspaces, and for u,v both odd the commutator in (4.2a) is replaced by an anticommu- 
tator. In the proof, the superspaces arise as cohomology groups, which naturally form an 
alternating sum. The centralisers Cm(g) in the Theorem are quite nice: e.g. for g in classes 
2A, 2B, 3A, 3B, 3C, 5A, 5B, 7A, 11A, respectively, these involve the sporadic groups B, 
Coi, Fih, Suz, Th, HN, HJ, He, and Miz. The proof for p = 2 is not complete at the 
present time. The conjectures in [105] concerning modular analogues of the Griess algebra 
for several sporadics follow from Theorem 7. 

Can these modular Vs be interpreted as a reduction mod p of (super)algebras in 
characteristic 0? Also, what about elements g of composite order? 


CONJECTURE 8. [13] Choose any g E€ M and let n denote its order. Then there is 
a +7Z-graded superspace IV = DjcizIV; over the ring of cyclotomic integers Lien, It 
is often (but probably not always) a vertex operator superalgebra — in particular 1Y is an 


integral form of the Moonshine module V”. Each IY carries a representation of a central 
extension of Cm(g) by Cn. Define the graded trace 


Big, hit) =47* D> chs (AG . 


i€4Z 
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Ifg,h € M commute and have coprime orders, then B(g, h; T) = Tgn(7). If all q-coefficients 
of T, are nonnegative, then the ‘odd’ part of 9V vanishes, and 9V is the g-twisted module 


V4(g) of [36]. If g has prime order p, then the reduction mod p of 9V is the modular vertex 
operator superalgebra 9V of Theorem 7. 


When we say 1) is an integral form for V", we mean that 1) has the same structure 
as a VOA, with everything defined over Z, and tensoring it with C recovers V”. This 
remarkable conjecture, which tries to explain Theorem 7, is completely open. 


5.5. The geometry of Moonshine. Algebra is the mathematics of structure, and so of 
course it has a profound relationship with every area of mathematics. Therefore the trick 
for finding possible fingerprints of Moonshine in say geometry is to look there for modular 
functions. And that search quickly leads to the elliptic genus. 

For details see e.g. [55], [108], [112]. All manifolds here are compact, oriented and 
differentiable. In Thom’s cobordism ring Q, elements are equivalence classes of cobordant 
manifolds, addition is connected sum, and multiplication is Cartesian product. The uni- 
versal elliptic genus ¢(M) is a ring homomorphism from Q & Q to the ring of power series 
in q, which sends n-dimensional manifolds with spin connections to a weight n/2 modular 
form of ['9(2) with integer coefficients. Several variations and generalisations have been 
introduced, e.g. the Witten genus assigns spin manifolds with vanishing first Pontryagin 
class a weight n/2 modular form of SL2(Z) with integer coefficients. 

Several deep relationships between elliptic genera and the general material reviewed 
elsewhere in this paper, have been uncovered. For instance, the important rigidity property 
of the Witten genus with respect to any compact Lie group action on the manifold, is a 
consequence of the modularity of the characters of affine algebras (our Theorem 4) [81]. 
The elliptic genus of a manifold M has been interpreted as the graded dimension of a 
vertex operator superalgebra constructed from M [111]. Seemingly related to this, [18] 
recovered the elliptic genus of a Calabi-Yau manifold X from the sheaf of vertex algebras 
in the chiral de Rham complex [85] attached to X. Unexpectedly, the elliptic genus of 
even-dimensional projective spaces P?” has nonnegative coefficients and in fact equals the 
graded dimension of a certain vertex algebra [86]; this suggests interesting representation- 
theoretic questions in the spirit of Monstrous Moonshine. In physics, elliptic genera arise 
as partition functions of N = 2 superconformal field theories [120]. Mason’s constructions 
[88] associated to Moonshine for the Mathieu group Mə4 have been interpreted as providing 
a geometric model (‘elliptic system’) for elliptic cohomology Ell*(BMgoz,) of the classifying 
space of M4 [112], [39]. The Witten genus (normalised by 7°) of the Milnor—Kervaire 
manifold Mọ, an 8-dimensional manifold built from the Eg diagram, equals j3 [55] (recall 
(3.5)). 

Hirzebruch’s ‘prize question’ (p.86 of [55]) asks for the construction of a 24-dimensional 
manifold M with Witten genus J (after being normalised by n?*). We would like M to act 
on M by diffeomorphisms, and the twisted Witten genera to be the McKay—Thompson 
series Ty. It would also be nice to associate Norton’s series N(g,h) to this Moonshine 
manifold. Constructing such a manifold is perhaps the remaining Holy Grail of Monstrous 
Moonshine. 

Hirzebruch’s question was partially answered by Mahowald and Hopkins [84], who 


23 


constructed a manifold with Witten genus J, but couldn’t show it would support an 
effective action of M. Related work is [3], who constructed several actions of M on e.g. 
24-dimensional manifolds (but none of which could have genus J), and [73], who showed 
the graded dimensions of the subspaces vi A. 


of the Moonshine module are twisted A-genera 


of Milnor-Kervaire’s manifold Mọ (the A-genus is the specialisation of elliptic genus to 
the cusp ioo). 

There has been a second conjectured relationship between geometry and Monstrous 
Moonshine. Mirror symmetry says that most Calabi-Yau manifolds come in closely related 
pairs. Consider a 1-parameter family X, of Calabi-Yau manifolds, with mirror X* given 
by the resolution of an orbifold X/G for G finite and abelian. Then the Hodge numbers 
ht1(X) and h?:+(X*) will be equal, and more precisely the moduli space of (complexified) 
Kahler structures on X will be locally isometric to the moduli space of complex structures 
on X*. The ‘mirror map’ z(q), which can be defined using the Picard—Fuchs equation [95], 
gives a canonical map between those moduli spaces. For example, xj + v3 + 23 + x4 + 
z7 14x zoz3x4 = 0 is such a family of K3 surfaces, where G = C4 x C4. Its mirror map is 
given by 


2(q) = q — 104q? + 64440? — 311744q* + 13018830q° — 493025760q° + --- . (5.6) 


Lian—Yau [80] noticed that the reciprocal 1/z(q) of the mirror map in (5.6) equals the 
McKay—Thompson series T,(7) +104 for g in class 2A of M. After looking at several other 
examples with similar conclusions, they proposed their Mirror-Moonshine Conjecture: The 
reciprocal 1/z of the mirror map of a 1-parameter family of K3 surfaces with an orbifold 
mirror, will be a McKay-Thompson series (up to an additive constant). 

A counterexample (and more examples) are given in §7 of [118]. In particular, al- 
though there are relations between mirror symmetry and modular functions (see e.g. [51] 
and [54]), there doesn’t seem to be any special relation with the Monster. Doran [40] ‘de- 
mystifies the Mirror-Moonshine phenomenon’ by finding necessary and sufficient conditions 
for 1/z to be a modular function for a modular group commensurable with SL2(Z). 


6. The physics of Moonshine 


The physical side (perturbative string theory, or equivalently conformal field theory) 
of Moonshine was noticed early on, and has profoundly influenced the development of 
Moonshine and VOAs. This is a very rich subject, which we can only superficially touch 
on. The book [32], with its extensive bibliography, provides an introduction but will be 
difficult reading for many mathematicians (as will this section!) The treatment in [45] is 
more accessible and shows how naturally VOAs arise from the physics. This effectiveness of 
physical interpretations isn’t magic — it merely tells us that many of our finite-dimensional 
objects are seen much more clearly when studied through infinite-dimensional structures 
(often by being ‘looped’). Of course Moonshine, which teaches us to study the finite group 
M via its infinite-dimensional module V", fits perfectly into this picture. 

A conformal field theory (CFT) is a quantum field theory on 2-dimensional space-time, 
whose symmetries include the conformal transformations. In string theory the basic objects 
are finite curves (‘strings’) rather than points (‘particles’), and the CFT lives on the surface 
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traced by the strings as they evolve (colliding and separating) through time. Each CFT is 
associated with a pair Vz, Vr of mutually commuting VOAs, called its chiral algebras [6]. 
For example, strings living on a compact Lie group manifold (the so-called Wess—Zumino— 
Witten model) will have chiral algebras given by affine algebra VOAs. The space H of 
states for the CFT carries a representation of Vz ® Vp, and many authors have (somewhat 
optimisticly) concluded that the study of CFTs reduces to that of VOA representation 
theory. Rational VOAs correspond to the important class of rational CFTs, where H 
decomposes into a finite sum Mz & Mp of irreducible modules. The Virasoro algebra 
(4.3) arises naturally in CFT through infinitesimal conformal transformations. The vertex 
operator Y (¢, z), for the space-time parameter z = etti? is the quantum field which creates 
from the vacuum |0) € H the state |¢) € H at time t = —oo: |) = lim,_.oY(¢, z) |0}. In 
particular, Borcherds’ definition [9] of VOAs can be interpreted as an axiomatisation of 
the notion of chiral algebra in CFT, and for this reason alone is important. 


In CFT, the Hauptmodul property of Moonshine is hard to interpret, and a less 
direct formulation like that in [116] is needed. However, both the statement and proof of 
Theorem 5 are natural from the CFT framework (see [45]) — e.g. the modularity of the 
series Tọ and N(g,h) are automatic in CFT. This modularity arises in CFT through the 
equivalence of the Hamiltonian formulation, which describes concretely the graded spaces 
we take traces on (and hence the coefficients of our g-expansions), and the Feynman path 
formalism, which interprets these graded traces as sections over moduli spaces (and hence 
makes modularity manifest). Beautiful reviews are sketched in [119], [120]. 


Because V4 is so mathematically special, it may be expected that it corresponds to 
interesting physics. Certainly it has been the subject of some speculation. There will be 
ac = 24 rational CFT whose chiral algebra Vz, and state space H are both V”, while 
Vp is trivial (this is possible because V4 is holomorphic). This CFT is nicely described 
in [34]; see also [35]. The Monster is the symmetry of that CFT, but the Bimonster 
M? Cə will be the symmetry of a rational CFT with H = V! @ VI. The paper [27] finds 
a family of D-branes for the latter theory which are in one-to-one correspondence with 
the elements of M, and their ‘overlaps’ ((g||q2(£°t£~ 22) ||h)) equal the McKay—Thompson 
series T,-1,. However, we still lack any explanation as to why a CFT involving V’ should 
yield interesting physics. 

Almost every facet of Moonshine finds a natural formulation in CFT, where it often 
was discovered first. For example, the ‘No-Ghost’ Theorem of Brower—Goddard—Thorn 
was used to great effect in [11] to understand the structure of the Monster Lie algebra m. 
On a finite-dimensional manifold M, the index of the Dirac operator D in the heat kernel 
interpretation is a path integral in supersymmetric quantum mechanics, i.e an integral over 
the free loop space LM = {y : St — M}; the string theory version of this is that the index 
of the Dirac operator on LM should be an integral over L(LM), i.e. over smooth maps 
of tori into M, and this is just the elliptic genus, and explains why it should be modular. 
The orbifold construction of [36] comes straight from CFT (although [43]’s construction of 
V’ predates CFT orbifolds by a year and in fact influenced their development in physics). 
That said, the translation process from physics to mathematics of course is never easy — 
Borcherds’ definition [9] is a prime example! 


But from this standpoint, what is most exciting is what hasn’t yet been fully exploited. 
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String theory tells us that CFT can live on any surface X}. The VOAs, including the 
geometric VOAs of [59], capture CFT in genus 0. The graded dimensions and traces 
considered above concern CFT quantities (‘conformal blocks’) at genus 1: T +> e?7'7 maps 
HI onto a cylinder, and the trace identifies the two ends. But there are analogues of all this 
at higher genus [123] (though the formulas can rapidly become awkward). For example, 
the graded dimension of e.g. the V! CFT in genus 2 is computed in [117], and involves 
e.g. Siegel theta functions. The orbifold theory in §5.1 is genus 1: each ‘sector’ (g, h) 
corresponds to a homomorphism from the fundamental group Z? of the torus into the 
orbifold group G (e.g G = M) — g and h are the targets of the two generators of Z? and 
hence must commute. More generally, the sectors will correspond to each homomorphism 
Y : 71(%) — G, and to each we will get a higher genus trace Z(y), which will be a function 
on the Teichmüller space T} (generalising the upper half-plane H for genus 1). The action 
of SL2(Z) on the N(g,h) generalises to the action of the mapping class group on 7(™) 
and T}. See e.g. [4] for some thoughts in this direction. 


7. Conclusion 


There are different basic aspects to Monstrous Moonshine: (i) why modularity enters 
at all; (ii) why in particular we have genus 0; and (iii) what does it have to do with the 
Monster. We understand (i) best. There will be a Moonshine-like relation between any 
(subgroup of the) automorphism group of any rational VOA, and the characters xm, and 
the same can be expected to hold of the orbifold characters Z in 85.1. 

To prove the genus 0 property of the T}, we needed recursions obtained one way or 
another from the Monster Lie algebra m, and from these we apply Theorem 6. These 
recursions are very special, but so presumably is the genus 0 property. The suggestion of 
[20] though is that we may be able to considerably simplify this part of the argument. 

Every group known to have rich Moonshine properties is contained in the Monster. 
Our understanding of this seemingly central role of M is the poorest of those three aspects. 

It should be clear from this review, of the central role VOAs play in our current under- 
standing of Moonshine. The excellent review [39] makes this point even more forcefully. It 
can be (and has been) questioned though whether the full and difficult machinery of VOAs 
is really needed to understand this, i.e. whether we really have isolated the key conjunction 
of properties needed for Moonshine to arise. CFT has been an invaluable guide thus far, 
but perhaps we are a little too steeped in its lore. 

Moonshine (in its more general sense) is a relation between algebra and number the- 
ory, and its impact on algebra has been dramatic (e.g. VOAs, V’, Borcherds-Kac-Moody 
algebras). Its impact on number theory has been far less so. This may merely be a tem- 
porary accident due to the backgrounds of most researchers (including the mathematical 
physicists) working to date in the area. But the most exciting prospects for the future of 
Moonshine (in this writer’s opinion) are in the direction of number theory. Hints of this 
future can be found in e.g. [121], [41], [14], [33], [52], [94]. 
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